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Accurate diagnosis of gastric intestinal metaplasia is important; however, conventional 
endoscopy is known to be an unreliable modality for diagnosing gastric intestinal 
metaplasia (IM). The aims of the study were to evaluate the interobserver variation in 
diagnosing IM by high-definition (HD) endoscopy and the diagnostic accuracy of this 
modality for IM among experienced and inexperienced endoscopists. Selected 50 cases, 
taken with HD endoscopy, were sent for a diagnostic inquiry of gastric IM through visual 
inspection to five experienced and five inexperienced endoscopists. The interobserver 
agreement between endoscopists was evaluated to verify the diagnostic reliability of HD 
endoscopy in diagnosing IM, and the diagnostic accuracy, sensitivity, and specificity were 
evaluated for validity of HD endoscopy in diagnosing IM. Interobserver agreement among 
the experienced endoscopists was "poor" (k = 0.38) and it was also "poor" (k = 0.33) 
among the inexperienced endoscopists. The diagnostic accuracy of the experienced 
endoscopists was superior to that of the inexperienced endoscopists (P= 0.003). Since 
diagnosis through visual inspection is unreliable in the diagnosis of IM, all suspicious areas 
for gastric IM should be considered to be biopsied. Furthermore, endoscopic experience 
and education are needed to raise the diagnostic accuracy of gastric IM. 
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INTRODUCTION 

Intestinal metaplasia (IM) of the gastric mucosa is an important 
premalignant lesion or condition (1, 2); hence it is expected 
that early detection of gastric cancer could be achieved through 
accurate diagnosis of IM with regular follow-up (3). However, 
the concordance of morphological appearance and histopath- 
ologic findings is low, and the accuracy of endoscopic diagno- 
sis is also known to be poor (4, 5). The reason for this is because 
ash-colored nodular changes spreading in the antrum, which is 
a typical endoscopic feature of IM is apt to be confused with 
antral nodular hyperplasia or raised erosion. In addition, flat 
type of IM is difficult to be found under the view of endoscopy 
alone (6). Therefore, accurate diagnosis of IM by means of en- 
doscopy is a major concern among those interested in the field 
of gastric cancer. 

Recently, a number of studies aimed at improving the rate of 
diagnostic rate of IM have been published. Conventional meth- 
ods ranging from multiple biopsies and methylene blue chro- 
moendoscopy to state-of-the-art narrow band imaging (NBI) 
and confocal laser endomicroscopy have been examined, and 
all yielded some promising results (7-10). Unfortunately all of 



these methods can create an increased cost or work load, espe- 
cially in the Asia-Pacific region where gastric cancer is preva- 
lent, making it nearly impossible to apply them widely. Thanks 
to advances in optical endoscopy techniques, even the mucosal 
structure can be visualized with high-definition (HD) endosco- 
py, and it has a similar detection rate to NBI or chromoendos- 
copy in diagnosing Barrett's esophagus and colon adenomas 
(11-13). Therefore it expected to be better than conventional 
endoscopy in accurately diagnosing gastric IM, even without 
any additional modality. To our knowledge, there has been no 
study of the diagnostic yield of HD endoscopy in diagnosing 
gastric IM. Hence the aims of this study were to evaluate 1) the 
interobserver variation in diagnosing gastric IM by HD endos- 
copy and 2) the diagnostic accuracy of this modality for gastric 
IM among experienced and inexperienced endoscopists. 

MATERIALS AND METHODS 
Overview of study design 

Selected 50 cases, taken with HD endoscopy, were sent for a di- 
agnostic inquiry of gastric IM through visual inspection to five 
experienced and five inexperienced endoscopists. We evaluat- 
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Selection of endoscopic cases 



50 cases (31 histopathologic IM) were selected from 1 ,596 cases, 
of which 1 case consisted of 4 high resolution still images. 



Request interpretation of selected cases 


5 experienced endoscopists 


5 inexperienced endoscopists 



Statistical analysis 

Interobserver variability and diagnostic accuracy of HD endoscopy for IM 



Fig. 1 . Flow diagram of the process of image evaluation. 

ed the interobserver agreement of the enrolled endoscopists in 
order to validate the diagnostic reliability of HD endoscopy in 
diagnosing IM. This was followed by the comparison of the di- 
agnostic accuracy between the experienced and inexperienced 
endoscopists group to find out whether endoscopic experience 
affects the diagnosis of IM. The study procedure is summarized 
in Fig. 1. 

Patients and endoscopic procedure 

Patients who received esophagogastroduodenoscopy (EGD) at 
Hanyang University Guri Hospital between October 2008 and 
September 2009 were included in the study. From 1,284 patients 
endoscopically diagnosed with IM in the endoscopic image da- 
tabase, and 648 patients histopathologically proven to have IM 
in the pathology database, a total of 1,596 patients (908 males, 
mean age 56 ± 14 yr) was finally enrolled after excluding 336 
patients whose data overlapped in the two databases. Accord- 
ingly, the study population consists of those who were endo- 
scopically diagnosed as gastric IM irrespective of histopatho- 
logic diagnosis, and those who were histopathologically diag- 
nosed as IM regardless of endoscopic diagnosis. 

All EGDs had been performed by one of four endoscopists. 
An HD endoscope (GIF-H260; Olympus Optical Co., Ltd, To- 
kyo, Japan) and high- definition television (HDTV) system (Evis 
Lucera; Olympus Optical Co., Ltd, Tokyo, Tapan) were used in 
all procedures. 

Selection of patients for interpretation 

We have considered the following conditions for the endoscop- 
ic cases of the patients referred for interpretation: first, the qual- 
ity of image should be that of a highest standard. Second, the 
level of quality of each image should be similar. Third, the im- 
ages should contain pictures that represent typical characteris- 
tics of the lesion. Fourth, histopathologic examination should 
be carried out to confirm the endoscopic diagnosis. To satisfy 
these conditions, we have designated the following selection 
criteria: 



Inclusion criteria 

1. Any case with more than 2 biopsies at a single site confirming 
the endoscopic diagnosis 

2. Any case with more than 3 vivid images from various angles 
including an image of the biopsy site 

3. Any case with more than a single vivid close-up image reveal- 
ing the fine mucosal surface of the biopsy site 

Exclusion criteria 

1 . Any cases with gastric cancer, gastric ulcer, or gastric polyp 

2. Any cases involving the use of additional diagnostic modali- 
ties such as chromoendoscopy or NBI 

3. Any cases with anatomical deformity due to conditions such 
as gastric surgery or chronic benign ulcer 

A total of 101 patients were selected from the 1,596 patients af- 
ter direct screening by a single gastroenterology faculty mem- 
ber and a single clinical fellow using the above criteria. Next, a 
final group of 50 cases was selected from 101 patients by simple 
random sampling using SPSS software version 18.0 (SPSS, Chi- 
cago, IL, USA). The finally selected 50 cases were composed of 
three different groups: a group diagnosed with IM both endo- 
scopically and histopathologically, a group diagnosed as IM 
endoscopically but not by histopathologic examination, and a 
group not initially diagnosed as IM through endoscopy but con- 
firmed as IM through histopathologic examination. 

Image evaluation study 

Production of endoscopic cases for interpretation: All endo- 
scopic images had vertical and horizontal resolutions of 300 
dots per inch, with a width of 900 and height of 780 pixels. The 
slides of the 50 cases were made into a slide show using Micro- 
soft Power Point 2007 (Microsoft, Redmond, WA, USA). Each 
case was composed of two slides: on the first slide the anatomi- 
cal position of the lesion where the biopsy taken was typed, and 
second slide endoscopic images were shown. The anatomical 
position shown on the first slide was written according to the 
minimal standard terminology 3.0 (14), and the stomach body 
was further subdivided into upper third, middle third, and low- 
er third. The second slide consisted of four endoscopic images, 
with one close-up image showing a clear mucosal structure of 
the biopsy site and three other images from different angles 
that included the biopsy site (Fig. 2). 

Assessors and interpretation of the selected cases: Five expe- 
rienced endoscopists were enrolled, of which two were assis- 
tant professors and the other three associate professors. The 
mean (SD) age of all faculty was 40.26 (1.94) yr and the mean 
(SD) duration of their endoscopic careers was 105.6 (33.30) 
months. The inexperienced endoscopists consisted of five clini- 
cal fellows, with mean (SD) age 33.84 (1.76) yr and mean (SD) 
duration of their endoscopic careers of 7.5 (1.36) months. The 
endoscopists were recruited from five different university-based 
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Fig. 2. Examples of a high-definition endoscopic case consisting of four images for interpretation. The close-up image shows the sharp mucosal surface of the biopsy site. The 
three other images include the site of biopsy from different angles and distances. (A) Nine out of 1 0 endoscopists agreed with the diagnosis of gastric IM on the anterior wall of 
lower body, and histopathology proved to be marked IM. (B) Two out of 10 endoscopists agreed with the diagnosis of gastric IM on the great curvature of antrum, and histopa- 
thology proved to be moderate IM. 



hospitals. 

The purpose and method of the study was explained in ad- 
vance to all the endoscopists, and 50 endoscopic cases were 
sent each endoscopist via e-mail. Endoscopic interpretation 
was performed according to the anatomical position of the le- 
sion presented by each case. There were no time limits in inter- 
pretation. If the endoscopist decided that a different finding 
was combined with IM, a diagnosis of IM was still to be made. 
The interpretation results were sent back via e-mail. 



ings in all the endoscopic cases were reviewed by two experi- 
enced gastrointestinal pathologists. The outcome parameters 
were sensitivity, specificity, overall accuracy, and diagnostic 
predictive value, of HD endoscopy. The diagnostic accuracy of 
the experienced and inexperienced endoscopists was com- 
pared in the following manner: each correct (true positive or 
true negative) score of an observer counted as +1 point; each 
incorrect score counted as 0 points. The sums of all scores were 
compared by the paired t-test (19). 



Statistical analysis 

Sample size was calculated in order to guarantee the design ac- 
curacy. We assumed the sensitivity of HD endoscopy to be aro- 
und 85% in diagnosing gastric IM. Furthermore, we considered 
that the lower 95% confidence limit should not fall below 0.65, 
with 0.95 probabilities. Therefore, the adequate sample size was 
calculated to be 50 (15). 

Statistical analysis of interobserver variability was performed 
with SPSS software version 18.0 (SPSS, Chicago, IL, USA). In- 
terobserver agreement was expressed as the percentage of full 
agreement among all observers, as well as by an overall k statis- 
tic with 95% confidence interval (95% CI) (16, 17). A k value gre- 
ater than 0.8 denoted excellent agreement, 0.8 to 0.6 denoted 
good agreement, 0.6 to 0.4 denoted fair agreement, and less than 
0.4 denoted poor agreement. A k value of 0 indicated agreement 
equal to chance, and a value less than 0 suggested disagreement 
(18). 

To calculate the accuracy in diagnosing IM, 2x2 tables were 
constructed to compare HD endoscopic diagnoses with histo- 
pathology as the reference standard. The histopathologic find- 



Ethics statement 

The study was approved by the institutional review board of 
Hanyang University Guri Hospital and by the local institutional 
review boards (IRB No. 2009-236). Informed consent was waiv- 
ed by the board. 

RESULTS 

Composition of the 50 endoscopy cases 

The composition of the 50 cases referred for interpretation to 
the 10 endoscopists was; 33 IM, 7 atrophic gastritis, 5 erosive 
gastritis, and 5 cases with other diagnoses according to the ini- 
tial endoscopic findings. The diagnoses of 31 IM, 11 chronic ac- 
tive gastritis, 5 chronic inflammation, and 3 with other diagno- 
ses were made according to the histopathologic diagnosis crite- 
ria. When the IM was categorized according to histopathologic 
severity, there were 1 1 mild, 14 moderate, and 6 marked cases. 

Interobserver variability 

The interobserver agreement in the experienced endoscopists 
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Table 1 . Interobserver variability between experienced and inexperienced endoscopists 



Subjects 


Experienced endoscopists 


Inexperienced endoscopists 


All endoscopists 


Full agreement (%) 


K-value (95% CI) 


Full agreement (%) 


K-value (95% CI) 


K-value (95% CI) 


All cases (n = 50) 


30 


0.38 (0.25-0.52) 


26 


0.33 (0.20-0.47) 


0.29(0.20-0.41) 


IM cases* (n = 31) 


26 


0.28(0.13-0.47) 


23 


0.27(0.12-0.47) 


0.22 (0.12-0.38) 


Mild (n = 1 1 ) 


27 


0.22(0.01-0.58) 


18 


0.22(0.01-0.58) 


0.20 (0.06-0.51) 


Moderate (n = 1 4) 


14 


0.21 (0.02-0.52) 


29 


0.35(0.12-0.64) 


0.22 (0.09-0.48) 


Marked (n = 6) 


50 


0.55(0.18-0.90) 


17 


0.36 (0.30-0.82) 


0.34 (0.10-0.78) 


Non-IM cases* (n = 19) 


37 


0.32 (0.13-0.57) 


32 


0.39(0.19-0.64) 


0.29 (0.15-0.51) 



'Cases which were histopathologically diagnosed. Histopathologic IM is categorized into mild, moderate, and marked according to severity. IM, Intestinal metaplasia; CI, confi- 
dence interval. 



Table 2. Diagnostic accuracy and predictive value of experienced and inexperienced 
endoscopists 



Diagnostic values 


Experienced 
endoscopists 


Inexperienced 
endoscopists 


All 

endoscopists 


Sensitivity 


74.2 


67.7 


61.3 


Specificity 


78.9 


52.6 


84.2 


Accuracy 


76.0* 


60.0* 


70.0 


Positive predictive value 


85.2 


70.0 


86.4 


Negative predictive value 


65.2 


50.0 


57.1 



All units are percent (%). *P< 0.05 between the experienced and inexperienced en- 
doscopists, compared with overall diagnostic accuracy. 



group for the 50 referred endoscopic cases was "poor" (k = 0.38; 
95% CI, 0.25-0.52) and it was also "poor" (k = 0.33; 0.20-0.47) in 
the inexperienced group. In addition, interobserver agreement 
among all endoscopists was also "poor" (Table 1). 

Interobserver agreement in both the experienced and inex- 
perienced group was also "poor" for the histologically proven 
31 IM cases, with K-values of 0.28 (0.13-0.47) and 0.27 (0.12-0.47) 
for the experienced and inexperienced group, respectively. Af- 
ter categorizing the IM according to severity into mild, moder- 
ate, and marked based on the Updated Sydney System (20), the 
interobserver agreement was "poor" for the mild (n = 11) and 
moderate (n = 14) cases in both groups. However, in the marked 
IM cases (n = 6), interobserver agreement in the experienced 
group was "fair to good" (k = 0.55; 0.18-0.90) but still "poor" in 
the inexperienced group. For the cases histologically proven to 
be non-IM (n = 19), the interobserver agreement for both groups 
was again "poor" with K-values of 0.32 (0.13-0.57) and 0.39 (0.19- 
0.64) for the experienced and inexperienced group, respectively. 

Diagnostic accuracy 

The sensitivity, specificity, and overall accuracy for IM of the 
experienced endoscopists were 74.2, 78.9, and 76.0%, respec- 
tively; while for the inexperienced endoscopists they were 67.7, 
52.6, and 60.0% respectively. In comparing diagnostic accuracy 
of the experienced and inexperienced endoscopists, the sum of 
score for the experienced endoscoipists was 161, and for the in- 
experienced endoscopists it was 139, indicating a higher diag- 
nostic accuracy rate for the experienced endoscopists (P = 0.003). 
The positive and negative predictive values for the experienced 



Table 3. Diagnostic accuracy and predictive value of each endoscopists 











Positive 


Negative 


Endoscopists 


Sensitivity 


Specificity 


Accuracy 


Predictive 


Predictive 










value 


value 


Experienced 


83.9 


63.2 


76.0 


78.8 


70.6 




74.2 


47.4 


64.0 


69.7 


52.9 




83.9 


63.2 


76.0 


78.8 


70.6 




45.2 


84.2 


60.0 


82.4 


48.5 




35.5 


94.7 


58.0 


91.7 


47.4 


Inexperienced 


58.1 


52.6 


56.0 


66.7 


43.5 




45.1 


68.4 


54.0 


70.0 


43.3 




80.7 


36.8 


64.0 


67.6 


53.6 




77.4 


26.3 


58.0 


63.2 


41.7 




51.6 


84.2 


64.0 


84.2 


51.6 



All units are percent (%). 



group were 85.2% and 65.2%, respectively, and 70.0% and 50.0%, 
respectively, for the inexperienced group (Table 2). Table 3 show- 
ed the value of diagnostic accuracy of each endoscopists. 

DISCUSSION 

In the present study, we have evaluated the interobserver agree- 
ment between endoscopists in order to verify the diagnostic re- 
liability of HD endoscopy in diagnosing IM, and further com- 
pared the diagnostic accuracy of experienced and inexperienc- 
ed endoscopists to find out whether the degree of experience 
affects the diagnostic accuracy. The interobserver agreement 
between both groups was poor, and the diagnostic accuracy of 
the experienced endoscopists was superior to that of the inex- 
perienced endoscopists. 

Endoscopes equipped with HDTV-compatible charge-cou- 
pled devices and HDTV video systems permit clear observation 
of even the fine structure of the mucosal surface, and are re- 
placing conventional endoscopy and becoming the standard 
equipment. In studies that compared the rate of detection of 
colon polyps with white-light (WL) and NBI using HD endos- 
copy, and the diagnostic rate of Barrett's esophagus with WL, 
chromoendoscopy, and NBI, HD endoscopy using only WL 
yielded similar results to the other additional enhancement 
techniques (11-13). In addition, although it is known that the 
value of conventional endoscopy for diagnosing gastric IM is 
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low, there has, until now, been no study of the diagnostic reli- 
ability of HD endoscopy, which provides high resolution imag- 
es (21). In the present study, interobserver agreement in diag- 
nosing gastric IM using WL HD endoscopic images was low in 
both the experienced (k = 0.38; 0.25-0.52) and inexperienced 
group (k = 0.33; 0.20-0.47). Therefore, although there have been 
great improvements in optical endoscopy since the invention 
of the endoscope, there remains a lingering doubt whether en- 
doscopy is trustworthy in diagnosing precancerous lesions or 
the condition known as gastric IM. Furthermore, the low in- 
terobserver agreement seems very meaningful since the expe- 
rienced endoscopists enrolled in the present study all had at 
least seven years of experience in screening endoscopy for gas- 
tric cancer in the Korean national cancer screening program for 
gastric cancer (22). The result of this study indicates that all sus- 
picious areas for gastric IM should be biopsied, and that diag- 
nosis through visual inspection is highly unreliable in the diag- 
nosis of IM. This is particularly true in regions with high preva- 
lence of gastric cancer. Therefore, our findings should be borne 
in mind when deciding on gastric cancer-related policies in coun- 
tries using endoscopy as a screening tool for people at high risk 
of gastric cancer. 

Variation in the quality of endoscopy likely has an important 
impact on patient outcomes, and then endoscopic training and 
experience should be needed for the high quality of endoscopy 
(23, 24). There have been several studies looking at the effects 
of endoscopic training and experience on endoscopic diagnos- 
tic accuracy, most of them confined to the diagnosis of colorec- 
tal polyps (19, 25, 26). To our knowledge, this is the first study 
that evaluates whether endoscopic experience influences the 
diagnostic yield in gastric IM. In this study, the overall diagnos- 
tic accuracy of the experienced endoscopists was superior to 
that of the inexperienced endoscopists (P = 0.003). In addition, 
in the analysis of interobserver agreement, the histopathologi- 
cally diagnosed marked IM cases by the experienced endosco- 
pists was the only instance which yielded "fair to good" (k = 0.55; 
0.18-0.90) interobserver agreement. This outcome indicates 
that HD endoscopy by experienced endoscopists may be diag- 
nostically acceptable in the limited category of cases classified 
as "marked" IM. Endoscopic experience is consequently need- 
ed in order to raise the diagnostic accuracy of gastric IM, and 
does have some influence on diagnostic reliability. 

This is the first study that validated the diagnostic reliability 
of HD endoscopy in diagnosing gastric IM, and emphasized the 
role of endoscopic experience on influencing the diagnostic ac- 
curacy. However, there are distinct limitations of our study. First, 
video recordings of the endoscopic procedures were not includ- 
ed in the slide show. Although video recording is much more 
realistic than still images, making it a great asset because of its 
resemblance to the environment of real-time endoscopy, in our 
study we used an endoscopic image database in order to access 



as many endoscopy cases as possible, thus missing out on any 
endoscopic video recordings that might be available. To make 
up for this, for every case we provided motionless images of the 
highest quality, consisting of three images from different angles 
with the biopsy site included, and a single close-up image show- 
ing a vivid pit pattern. Nevertheless, the diagnostic reliability or 
validity of HD endoscopy may be lower than expected since 
there are no video recordings included in the slide show. Sec- 
ond, the present study was a retrospective study which enrolled 
cases diagnosed as IM through either endoscopic finding or 
histopathological confirmation, which may have introduced 
some evidence of selection bias. Despite this fact, the interob- 
server agreement of both groups was poor. 

In conclusion, diagnosis through visual inspection under HD 
endoscopy is unreliable in the diagnosis of IM. Therefore, all 
suspicious areas for gastric IM should be considered to be bi- 
opsied. Furthermore, endoscopic experience and education 
are needed to raise the diagnostic accuracy of gastric IM. 
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